SEQUENCE LISTING 
(1) GENERAL INFORMATION: 
(i) APPLICANT: RABIN, Mark B. 

(ii) TITLE OF INVENTION: MUTATIONS IN THE BRCA1 GENE 

(iii) NUMBER OF SEQUENCES: 10 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Howrey & Simon 

(B) STREET: 12 99 Pennsylvania Avenue, N.W. 

(C) CITY: Washington 

(D) STATE: District of Columbia 

(E) COUNTRY: USA 

(F) ZIP: 20004 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE : Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 

(D) SOFTWARE: FastSEQ for Windows Version 2.0b 

(vi) CURRENT APPLICATION DATA: . 

(A) APPLICATION NUMBER: 09/038,946 

(B) FILING DATE: March 12, 1998 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 



(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Halluin, Albert P. 

(B) REGISTRATION NUMBER: 25,227 

(C) REFERENCE/DOCKET NUMBER : 05371,0032.999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-3.83-7073 

(B) TELEFAX: 202-383-6610 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5711 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AGCTCGCTGA GACTTCCTGG ACCCCGCACC AGGCTGTGGG GTTTCTCAGA TAACTGGGCC 60 

CCTGCGCTCA GGAGGC CTTC ACCCTCTGCT CTGGGTAAAG TTCATTGGAA CAGAAAGAAA 12 0 

TGGATTTATC TGCTCTTCGC GTTGAAGAAG TACAAAATGT CATTAATGCT ATGCAGAAAA 18 0 

TCTTAGAGTG TCCCATCTGT CTGGAGTTGA TCAAGGAACC TGTCTCCACA AAGTGTG AC C 240 

ACATATTTTG CAAATTTTGC ATGCTGAAAC TTCTCAACCA GAAGAAAGGG CCTTCACAGT 3 00 

GTCCTTTATG TAAGAATGAT ATAACCAAAA GG AG C C T AC A AG AAAGT AC G AGATTTAGTC 3 60 

AACTTGTTGA AGAGCTATTG AAAATCATTT GTGCTTTTCA GCTTGACACA GGTTTGGAGT 42 0 

ATGCAAACAG CTATAATTTT GCAAAAAAGG AAAATAACTC TCCTGAACAT CTAAAAGATG 48 0 

AAGTTTCTAT CATCCAAAGT ATGGGCTACA GAAACCGTGC CAAAAGACTT CTACAGAGTG 54 0 

AACCCGAAAA TCCTTCCTTG CAGGAAACCA GTCTCAGTGT CCAACTCTCT AACCTTGGAA 600 

CTGTGAGAAG TCTGAGGACA AAGCAGCGGA T AC AAC C T C A AAAGACGTCT GTCTACATTG 660 

AATTGGGATC TGATTCTTCT GAAGATACCG TTAATAAGGC AACTTATTGC AGTGTGGGAG 72 0 

ATCAAGAATT GTTACAAATC ACCCCTCAAG GAACCAGGGA TGAAATCAGT TTGGATTCTG 7 80 

CAAAAAAGGC TGCTTGTGAA TTTTCTGAGA CGGATGTAAC AAATACTGAA CATCATCAAC 84 0 

CCAGTAATAA TGATTTGAAC AC C AC T GAGA AGCGTGCAGC TGAGAGGCAT CCAGAAAAGT 90 0 

ATCAGGGTAG TTCTGTTTCA AACTTGCATG TGGAGCCATG TGGCACAAAT ACTCATGCCA 960 

G C T C ATT AC A GCATGAGAAC AG C AGTTT AT TACTCACTAA AGACAGAATG AATGTAGAAA 102 0 

AGGCTGAATT CTGTAATAAA AGCAAACAGC CTGGCTTAGC AAGGAGCCAA CATAACAGAT 10 80 

GGGCTGGAAG TAAGGAAACA TGTAATGATA GGCGGACTCC CAGCACAGAA AAAAAGGTAG 114 0 

ATCTGAATGC TGATCCCCTG TGTGAGAGAA AAGAATGGAA T AAG C AG AAA CTGCCATGCT 12 00 

CAGAGAATCC TAGAGATACT GAAGATGTTC CTTGGATAAC AC TAAAT AG C AG CAT T C AGA 12 60 

AAGTTAATGA GTGGTTTTCC AGAAGTGATG AACTGTTAGG TTCTGATGAC TCACATGATG 13 2 0 

GGGAGTCTGA ATCAAATGCC AAAGT AG CTG ATGTATTGGA CGTTCTAAAT GAGGTAGATG 13 8 0 

AATATTCTGG TTCTTCAGAG AAAAT AG AC T TACTGGCCAG TGATCCTCAT GAGGCTTTAA 144 0 

TATGTAAAAG TGAAAGAGTT CACTCCAAAT CAGTAGAGAG TAATATTGAA GACAAAATAT 15 00 

TTGGGAAAAC C TAT C GG AAG AAGGCAAGCC TCCCCAACTT AAGCCATGTA ACT G AAAAT C 15 60 

TAATTATAGG AGCATTTGTT ACTGAGCCAC AGATAATACA AGAGCGTCCC CTCACAAATA 16 2 0 

AATTAAAGCG TAAAAGGAGA C CT AC AT C AG GCCTTCATCC TGAGGATTTT ATCAAGAAAG 16 8 0 

CAGATTTGGC AGTTCAAAAG ACTCCTGAAA TGATAAATCA GGGAACTAAC CAAACGGAGC 174 0 

AGAATGGTCA AGTGATGAAT AT T ACTAAT A GTGGTCATGA GAATAAAACA AAAGGTG AT T 18 0 0 

CTATTCAGAA TGAGAAAAAT CCTAACCCAA TAGAATCACT CGAAAAAGAA TCTGCTTTCA 18 60 

AAACGAAAGC TGAACCTATA AG C AG C AGTA TAAGCAATAT GGAACTCGAA T TAAAT AT C C 192 0 

ACAATTCAAA AG C AC CTAAA AAGAATAGGC TGAGGAGGAA GTCTTCTACC AGG CAT AT T C 19 8 0 

ATGCGCTTGA ACTAGTAGTC AGTAGAAATC TAAGCCCACC TAATTGTACT GAATTGCAAA 204 0 

TTGATAGTTG TTCTAGCAGT GAAGAGATAA AGAAAAAAAA GTACAACCAA ATGCCAGTCA 210 0 

GGCACAGCAG AAACCTACAA CTCATGGAAG GTAAAGAACC TGCAACTGGA GC C AAG AAG A 2160 

GTAACAAGCC AAATGAACAG ACAAGTAAAA GACATGACAG TGATACTTTC CCAGAGCTGA 22 2 0 

AGTTAACAAA TGCACCTGGT T CTTTT AC T A AGTGTTCAAA T AC C AGTGAA CTTAAAGAAT 22 8 0 

TTGTCAATCC TAGCCTTCCA AGAGAAGAAA AAGAAGAGAA ACTAGAAACA GTTAAAGTGT 234 0 

CTAATAATGC TGAAGACCCC AAAGATCTCA TGTTAAGTGG AGAAAGGGTT TTGCAAACTG 24 0 0 

AAAGATCTGT AG AG AGT AG C AGTATTTCAC TGGTACCTGG TACTGATTAT GGCACTCAGG 24 60 

AAAGT AT CT C GTTACTGGAA GTTAGCACTC TAGGGAAGGC AAAAACAGAA CCAAATAAAT 252 0 

GTGTGAGTCA GTGTGCAGCA TTTGAAAACC C C AAGGG ACT AATTCATGGT TGTTCCAAAG 25 8 0 

ATAATAGAAA TGACACAGAA GGCTTTAAGT ATCCATTGGG ACATGAAGTT AAC C AC AGT C 264 0 

GGGAAACAAG CATAGAAATG GAAGAAAGTG AACTTGATGC TCAGTATTTG CAGAATACAT 27 0 0 

TCAAGGTTTC AAAGCGCCAG TCATTTGCTC TGTTTTCAAA TCCAGGAAAT GCAGAAGAGG 2760 

AATGTGCAAC ATTCTCTGCC CACTCTGGGT CCTTAAAGAA ACAAAGTCCA AAAGT C AC TT 2 82 0 

TTGAATGTGA ACAAAAGGAA G AAAAT C AAG GAAAGAATGA GTCTAATATC AAGCCTGTAC 28 80 

AGACAGTTAA TATCACTGCA GGCTTTCCTG TGGTTGGTCA GAAAGATAAG C C AGTTGAT A 2 94 0 

ATGCCAAATG T AGT AT C AAA GGAGGCTCTA GGTTTTGTCT ATCATCTCAG TTCAGAGGCA 3 00 0 

- ACGAAACTGG ACTCATTACT CCAAATAAAC ATGGACTTTT ACAAAACCCA TATCGTATAC 3 060 

C AC CACTTTT T C C CAT C AAG TCATTTGTTA AAACTAAATG TAAGAAAAAT C TG C TAG AGG 312 0 

AAAACTTTGA GGAACATTCA ATGTCACCTG AAAGAGAAAT GGGAAATGAG AAC AT T C C AA 3180 

GT AC AGT GAG CACAATTAGC CGTAATAACA TTAGAGAAAA TGTTTTTAAA GGAGCCAGCT 3 240 

CAAGCAATAT TAATGAAGTA GGTTCCAGTA CTAATGAAGT GGGCTCCAGT ATTAATGAAA 3 3 00 
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TAGGTTCCAG TGATGAAAAC ATTCAAGCAG AACTAGGTAG AAACAGAGGG CCAAAATTGA 33 60 

ATGCTATGCT TAG AT T AGG G GTTTTGCAAC CTGAGGTCTA TAAACAAAGT CTTCCTGGAA 3 4 2*0 

GTAATTGTAA GCATCCTGAA ATAAAAAAGC AAGAATATGA AGAAGTAGTT GAG AC T GTT A 34 8 0 

ATACAGATT-T CTCTCCATAT CTGATTTCAG ATAACTTAGA ACAGCCTATG GGAAGTAGTC 3 54 0 

ATGCATCTCA GGTTTGTTCT GAGACACCTG ATGACCTGTT AG ATGATG GT GAAATAAAGG 3 60 0 

AAG AT AC TAG TTTTGGTGAA AATGACATTA AGGAAAGTTC TGCTGTTTTT AGCAAAAGCG 3 6 60 

TC C AG AG AGG AGAGCTTAGC AGGAGTCCTA GCCCTTTCAC C CAT AC AC AT TTGGCTCAGG 37 2 0 

GTTACCGAAG AGGGGCCAAG AAATTAGAGT C C T C AG AAG A GAACTTATCT AGTGAGGATG 37 80 

AAGAGCTTCC CTGCTTCCAA CACTTGTTAT TTGGTAAAGT AAACAATATA CCTTCTCAGT 3 84 0 

CTACTAGGCA TAGCACCGTT GCTACCGAGT GTCTGTCTAA GAACACAGAG GAGAATTTAT 3 9 00 

TATCATTGAA GAATAGCTTA AATGACTGCA GTAACCAGGT AATATTGGCA AAGGCATCTC 3 9 60 

AGGAAC AT C A CCTTAGTGAG GAAACAAAAT GTTCTGCTAG CTTGTTTTCT TCACAGTGCA 4 02 0 

GTGAATTGGA AGACTTGACT GCAAATACAA ACACCCAGGA TCCTTTCTTG ATTGGTTCTT 40 8 0 

CCAAACAAAT G AGG CAT C AG TCTGAAAGCC AGGGAGTTGG TCTGAGTGAC AAGGAATTGG 414 0 

TTTCAGATGA TGAAGAAAGA GGAACGGGCT TGGAAGAAAA TAATCAAGAA GAG C AAAGC A 42 0 0 

TGGATTCAAA CTTAGGTGAA GCAGGATCTG GGTGTGAGAG TGAAACAAGC GTCTCTGAAG 42 6 0 

ACTGCTCAGG GCTATCCTCT CAGAGTGACA TTTTAACCAC TCAGCAGAGG GATACCATGC 43 2 0 

AACATAACCT GATAAAGCTC CAGCAGGAAA TGGCTGAACT AGAAGCTGTG TTAGAACAGC 43 80 

ATGGGAGCCA GCCTTCTAAC AGCTACCCTT C CAT CAT AAG TGACTCCTCT GCCCTTGAGG 444 0 

AC C TG CGAAA TCCAGAACAA AGCACATCAG AAAAAGCAGT ATTAACTTCA CAGAAAAGTA 4 5 00 

GTGAATACCC TAT AAG C C AG AATCCAGAAG GCCTTTCTGC TGACAAGTTT GAGGTGTCTG 45 60 

CAGATAGTTC T AC C AGT AAA AATAAAGAAC CAGGAGTGGA AAG GT CAT C C CCTTCTAAAT 4 62 0 

GCCCATCATT AGATGATAGG TGGTACATGC ACAGTTGCTC TGGGAGTCTT CAGAATAGAA 4 68 0 

ACTACCCATC T C AAG AGG AG C TC ATT AAG G TT GTT GAT GT GGAGGAGCAA CAGCTGGAAG 474 0 

AGTCTGGGCC ACACGATTTG ACGGAAACAT CTTACTTGCC AAGGCAAGAT CTAGAGGGAA 4 8 00 

CCCCTTACCT GGAATCTGGA ATCAGCCTCT TCTCTQATGA CCCTGAATCT GATCCTTCTG 4860 

AAG AC AG AG C C C C AG AGT C A GCTCGTGTTG G C AAC AT AC C ATCTTCAACC TCTGCATTGA 4 92 0 

AAGTTCCCCA ATTGAAAGTT GCAGAATCTG CCCAGGGTCC AGCTGCTGCT CAT ACT AC TG 4 98 0 
ATACTGCTGG GTATAATGCA ATGGAAGAAA GTGTGAGCAG GG AG AAG CCA GAATTGACAG 5040 
CTTCAACAGA AAGGGTCAAC AAAAGAATGT CCATGGTGGT GTCTGGCCTG ACCCCAGAAG 510 0 
AATTTATGCT CGTGTACAAG TTTGCCAGAA AACACCACAT CACTTTAACT AATCTAATTA 516 0 
CTGAAGAGAC TACTCATGTT GTTATGAAAA CAGATGCTGA GTTTGTGTGT GAACGGACAC 52 2 0 
TGAAATATTT TCTAGGAATT GCGGGAGGAA AATGGGTAGT TAGCTATTTC TGGGTGACCC 52 8 0 
AGTCTATTAA AGAAAGAAAA ATGCTGAATG AGCATGATTT TGAAGTCAGA GGAGATGTGG 5340 
TCAATGGAAG AAACCACCAA GGTCC AAAGC GAGCAAGAGA ATCCCAGGAC AGAAAGATCT 54 0 0 
TCAGGGGGCT AGAAATCTGT TGCTATGGGC CCTTCACCAA CATGCCCACA G AT C AAC TG G 5 46 0 
AATGGATGGT ACAGCTGTGT GGTGCTTCTG TGGTGAAGGA GCTTTCATCA TTCACCCTTG 5 52 0 
GCACAGGTGT CCACCCAATT GTGGTTGTGC AGCCAGATGC CTGGACAGAG GACAATGGCT 55 8 0 
TCCATGCAAT TGGGCAGATG TGTGAGGCAC CTGTGGTGAC CCGAGAGTGG GTGTTGGACA 5 640. 
GTGTAGCACT CTACCAGTGC CAGGAGCTGG ACACCTACCT GATACCCCAG ATCCCCCACA 57 0 0 
GCCACTACTG A 5711 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1863 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Ala Met Gin Lys lie Leu Glu Cys Pro lie Cys Leu Glu Leu lie Lys 
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20 25 30 

Glu Pro Val Ser Thr Lys Cys Asp His lie Phe Cys Lys Phe Cys Met 

35 40 45 

Leu Lys Leu Leu Asn Gin Lys Lys Gly Pro Ser Gin Cys Pro Leu Cys 

50 55 60 

Lys Asn Asp He Thr Lys Arg Ser Leu Gin Glu Ser Thr Arg Phe Ser 
65 70 75 80 

Gin Leu Val Glu Glu Leu Leu Lys He lie Cys Ala Phe Gin Leu Asp 

85 90 95 

Thr Gly Leu Glu Tyr Ala Asn Ser Tyr Asn Phe Ala Lys Lys Glu Asn 

100 105 110 

Asn Ser Pro Glu His Leu Lys Asp Glu Val Ser He He Gin Ser Met 

115 120 125 

Gly Tyr Arg Asn Arg Ala Lys Arg Leu Leu Gin Ser Glu Pro Glu Asn 

130 135 140 

Pro Ser Leu Gin Glu Thr Ser Leu Ser Val Gin Leu Ser Asn Leu Gly 
145 150 155 160 

Thr Val Arg Thr Leu Arg Thr Lys Gin Arg He Gin Pro Gin Lys Thr 

165 170 175 

Ser Val Tyr He Glu Leu Gly Ser Asp Ser Ser Glu Asp Thr Val Asn 

180 185 190 

Lys Ala Thr Tyr Cys Ser Val Gly Asp Gin Glu Leu Leu Gin He Thr 

195 200 205 " 

Pro Gin Gly Thr Arg Asp Glu He Ser Leu Asp Ser Ala Lys Lys Ala 

210 215 220 

Ala Cys Glu Phe Ser Glu Thr Asp Val Thr Asn Thr Glu His His Gin 
225 230 235 240 

Pro Ser Asn Asn Asp Leu Asn Thr Thr Glu Lys Arg Ala Ala Glu Arg 

245 250 255 

His Pro Glu Lys Tyr Gin Gly Ser Ser Val Ser Asn Leu His Val Glu 

260 265 270 

Pro Cys Gly Thr Asn Thr His Ala Ser Ser Leu Gin His Glu Asn Ser 

275 280 285 

Ser Leu Leu Leu Thr Lys Asp Arg Met Asn Val Glu Lys Ala Glu Phe 

290 295 300 

Cys Asn Lys Ser Lys Gin Pro Gly Leu Ala Arg Ser Gin His Asn Arg 
305 310 315 320 

Trp Ala Gly Ser Lys Glu Thr Cys Asn Asp Arg Arg Thr Pro Ser Thr 

325 330 335 

Glu Lys Lys Val Asp Leu Asn Ala Asp Pro Leu Cys Glu Arg Lys Glu 

340 345 350 

Trp Asn Lys Gin "Lys Leu Pro Cys Ser Glu Asn Pro Arg Asp Thr- Glu 

355 360 365 

Asp Val Pro Trp He Thr Leu Asn Ser Ser He Gin Lys Val Asn Glu 

370 375 380 

Trp Phe Ser Arg Ser Asp Glu Leu Leu Gly Ser Asp Asp Ser His Asp 
385 390 395 400 

Gly Glu Ser Glu Ser Asn Ala Lys Val Ala Asp Val Leu Asp Val Leu 

405 410 415 

Asn Glu Val Asp Glu Tyr Ser Gly Ser Ser Glu Lys He Asp Leu Leu 

420 425 430 

Ala Ser Asp Pro His Glu Ala. Leu He Cys Lys Ser. Glu Arg Val His 

435 440 445 

Ser Lys Ser Val Glu Ser Asn He Glu Asp Lys He Phe Gly Lys Thr 

450 455 460 

Tyr Arg Lys Lys Ala Ser Leu Pro Asn Leu Ser His Val Thr Glu Asn 
465 470 475 480 
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Leu lie lie Gly 

Pro Leu Thr Asn 
500 

His Pro Glu Asp 
515 

Pro Glu Met lie 
530 

Val Met Asn lie 
545 

Ser lie Gin Asn 

Glu Ser Ala Phe 
580 

Asn Met Glu Leu 
595 

Asn Arg Leu Arg 
610 

Leu Val Val Ser 
625 

lie Asp Ser Cys 

Gin Met Pro Val 
660 

Glu Pro Ala Thr 
675 

Ser Lys Arg His 
690 

Ala Pro Gly Ser 
705 

Phe Val Asn Pro 

Thr Val Lys Val 
740 

Ser Gly Glu Arg 
755 

lie Ser Leu Val 
770 

Leu Leu Glu Val 
785 

Cys Val Ser Gin 

Gly Cys Ser Lys 
820 

Leu Gly His Glu 
835 

Glu Ser Glu Leu 
850 

Lys Arg Gin Ser 
865 

Glu Cys Ala Thr 

Pro Lys Val Thr 
900 

Asn Glu Ser Asn 
915 

Phe Pro Val Val 



Ala Phe Val Thr 
485 

Lys Leu Lys Arg 

Phe He Lys Lys 
520 

Asn Gin Gly Thr 
535 

Thr Asn Ser Gly 
550 

Glu Lys Asn Pro 
565 

Lys Thr Lys Ala 

Glu Leu Asn He 
600 

Arg Lys Ser Ser 
615 

Arg Asn Leu Ser 
630 

Ser Ser Ser Glu 
645 

Arg His Ser Arg 

Gly Ala Lys Lys 
680 

Asp Ser Asp Thr 
695 

Phe Thr Lys Cys 
710 

Ser Leu Pro Arg 
725 

Ser Asn Asn Ala 

Val Leu Gin Thr 
760 

Pro Gly Thr Asp 
775 

Ser Thr Leu Gly 
790 

Cys Ala Ala Phe 
805 

Asp Asn Arg Asn 

Val Asn His Ser 
840 

Asp Ala Gin Tyr 
855 

Phe Ala Leu Phe 
870 

Phe Ser Ala His 
885 

Phe Glu Cys Glu 

lie Lys Pro Val 
920 

Gly Gin Lys Asp 



Giu Pro Gin He 
490 

Lys Arg Arg Pro 
505 

Ala Asp Leu Ala 

Asn Gin Thr Glu 
540 

His Glu Asn Lys 
555 

Asn Pro He Glu 
570 

Glu Pro He Ser 
585 

His Asn Ser Lys 

Thr Arg His He 
620 

Pro Pro Asn Cys 

635 • 
Glu He Lys Lys 
650 

Asn Leu Gin Leu 
665 

Ser Asn Lys Pro 

Phe Pro Glu Leu 
700 

Ser Asn Thr Ser 
715 

Glu Glu Lys Glu 
730 

Glu Asp Pro Lys 
745 

Glu Arg Ser Val 

Tyr Gly Thr Gin 
780 

Lys Ala Lys Thr 
795 

Glu Asn Pro Lys 
810 

Asp Thr Glu Gly 
825 

Arg Glu Thr Ser 

Leu Gin Asn Thr 
860 

Ser Asn Pro Gly 
875 

Ser Gly Ser Leu 
890 - 

Gin Lys Glu Glu 
905 

Gin Thr Val Asn 
Lys Pro Val Asp 



lie Gin Glu Arg 
495 

Thr Ser Gly Leu 
510 

Val Gin Lys Thr 
525 

Gin Asn Gly Gin 

Thr Lys Gly Asp 
560 

Ser Leu Glu Lys 
575 

Ser Ser He Ser 
590 

Ala Pro Lys Lys 
605 

His Ala Leu Glu 

Thr Glu Leu Gin 
640 

Lys Lys Tyr Asn 
655 

Met Glu Gly Lys 
670 

Asn Glu Gin Thr 
685 

Lys Leu Thr Asn 

Glu Leu Lys Glu 
720 

Glu Lys Leu Glu 
735 

Asp Leu Met Leu 
750 

Glu Ser Ser Ser 
765 

Glu Ser He Ser 

Glu Pro Asn Lys 
800 

Gly Leu He His 
815 

Phe Lys Tyr Pro 
830 

He Glu Met Glu 
845 

Phe Lys Val Ser 

Asn Ala Glu Glu 
680 

Lys Lys Gin Ser 
895 

Asn Gin Gly Lys 
910 

He Thr Ala Gly 
925 

Asn Ala Lys Cys 
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930 935 940 

Ser lie Lys Gly Gly Ser Arg Phe Gys Leu- Ser Ser Gin Phe Arg Gly 
945 950 955 960 

Asn Glu Thr Gly Leu lie Thr Pro Asn Lys His Gly Leu Leu Gin Asn 

965 970 975 

Pro Tyr Arg lie Pro Pro Leu Phe Pro lie Lys Ser Phe Val Lys Thr 

980 985 990 

Lys Cys Lys Lys Asn Leu Leu Glu Glu Asn Phe Glu Glu His Ser Met 

995 1000 1005 

Ser Pro Glu Arg Glu Met Gly Asn Glu Asn lie Pro Ser Thr Val Ser 

1010 1015 1020 

Thr lie Ser Arg Asn Asn lie Arg Glu Asn Val Phe Lys Gly Ala Ser 
1025 1030 , 1035 104 

Ser Ser Asn lie Asn Glu Val Gly Ser Ser Thr Asn Glu Val Gly Ser 

1045 1050 1055 

Ser lie Asn Glu He Gly Ser Ser Asp Glu Asn He Gin Ala Glu Leu 

1060 1065 1070 

Gly Arg Asn Arg Gly Pro Lys Leu Asn Ala Met Leu Arg Leu Gly Val 

1075 1080 1085 

Leu Gin Pro Glu Val Tyr Lys Gin Ser Leu Pro Gly Ser Asn Cys Lys 

1090 1095 1100 

His Pro Glu He Lys Lys Gin Glu Tyr Glu Glu Val Val Gin Thr Val 
1105 1110 1115 112 

Asn Thr Asp Phe Ser Pro Tyr Leu He Ser Asp Asn Leu Glu Gin Pro 

li25 1130 1135 

Met Gly Ser Ser His Ala Ser Gin Val Cys Ser Glu Thr Pro Asp Asp 

1140 1145 1150 

Leu Leu Asp Asp Gly Glu He Lys Glu Asp Thr Ser Phe Ala Glu Asn 

1155 1160 1165 

Asp He Lys Glu Ser Ser Ala Val Phe Ser Lys Ser Val Gin Arg Gly 

1170 1175 1180 

Glu Leu Ser Arg Ser Pro Ser Pro Phe Thr His Thr His Leu Ala Gin 
1185 1190 1195 120 

Gly Tyr Arg Arg Gly Ala Lys Lys Leu Glu Ser Ser Glu Glu Asn Leu 

1205 1210 1215 

Ser Ser Glu Asp Glu Glu Leu Pro Cys Phe Gin His Leu Leu Phe Gly 

1220 1225 1230 

Lys Val Asn Asn He Pro Ser Gin Ser Thr Arg His Ser Thr Val Ala 

1235 " 1240 1245 

Thr Glu Cys Leu Ser Lys Asn Thr Glu Glu Asn Leu Leu Ser Leu Lys 

1250 1255 1260 

Asn Ser Leu Asn Asp Cys Ser Asn Gin Val He Leu Ala Lys Ala Ser 
1265 1270 1275 128 

Gin Glu His His Leu Ser Glu Glu Thr Lys Cys Ser Ala Ser Leu Phe 

1285 1290 1295 

Ser Ser Gin Cys Ser Glu Leu Glu Asp Leu Thr Ala Asn Thr Asn Thr 

1300 1305 1310 

Gin Asp Pro Phe Leu He Gly Ser Ser Lys Gin Met Arg His Gin Ser 

1315 1320 1325 

Glu Ser Gin Gly Val Gly Leu Ser Asp Lys Glu Leu Val Ser Asp Asp 

1330 1335 1340 

Glu Glu Arg Gly Thr Gly Leu Glu Glu Asn Asn Gin Glu Glu Gin Ser 
1345 1350 1355 136 

Met Asp Ser Asn Leu Gly Glu Ala Ala Ser Gly Cys Glu Ser Glu Thr 

1365 1370 1375 

Ser Val Ser Glu Asp Cys Ser Gly Leu Ser Ser Gin Ser Asp He Leu 
1380 1385 1390 
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Thr Thr Gin Gin Arg Asp Thr Met Gin His Asn Leu lie Lys Leu Gin 

1395 1400 1405 

Gin Glu Met Ala Glu Leu Glu Ala Val Leu Glu Gin His Gly Ser Gin 

1410 - 1415 1420 

Pro Ser Asn Ser Tyr Pro Ser lie lie Ser Asp Ser Ser Ala Leu Glu 
1425 1430 1435 144 

Asp Leu Arg Asn Pro Glu Gin Ser Thr Ser Glu Lys Ala Val Leu Thr 

1445 1450 1455 

Ser Gin Lys Ser Ser Glu Tyr Pro lie Ser Gin Asn Pro Glu Gly Leu 

1460 1465 1470 

Ser Ala Asp Lys Phe Glu Val Ser Ala Asp Ser Ser Thr Ser Lys Asn 

1475 1480 1485 

Lys Glu Pro Gly Val Glu Arg Ser Ser Pro Ser Lys Cys Pro Ser Leu 

1490 1495 1500 

Asp Asp Arg Trp Tyr Met His Ser Cys Ser Gly Ser Leu Gin Asn Arg 
1505 1510 1515 152 

Asn Tyr Pro Ser Gin Glu Glu Leu lie Lys Val Val Asp Val Glu Glu 

1525 1530 1535 

Gin Gin Leu Glu Glu Ser Gly Pro His Asp Leu Thr Glu Thr Ser Tyr 

1540 1545 1550 

Leu Pro Arg Gin Asp Leu Glu Gly Thr Pro Tyr Leu Glu Ser Gly lie 

1555 1560 1565 

Ser Leu Phe Ser Asp Asp Pro Glu Ser Asp Pro Ser Glu Asp Arg Ala 

1570 1575 1580 

Pro Glu Ser Ala Arg Val Gly Asn lie Pro Ser Ser Thr Ser Ala Leu 
1585 1590 1595 160 

Lys Val Pro Gin Leu Lys Val Ala Glu Ser Ala Gin Gly Pro Ala Ala 

1605 1610 1615 

Ala His Thr Thr Asp Thr Ala Gly Tyr Asn Ala Met Glu Glu Ser Val 

1620 1625 1630 

Ser Arg Glu Lys Pro Glu Leu Thr Ala Ser Thr Glu Arg Val Asn Lys 

1635 1640 1645 

Arg Met Ser Met Val Val Ser Gly Leu Thr Pro Glu Glu Phe Met Leu 

1650 1655 1660 

Val Tyr Lys Phe Ala Arg Lys His His lie Thr Leu Thr Asn Leu lie 
1665 1670 1675 168 

Thr Glu Glu Thr Thr His Val Val Met Lys Thr Asp Ala Glu Phe Val 

1685 1690 1695 

Cys Glu Arg Thr Leu Lys Tyr Phe Leu Gly lie Ala Gly Gly Lys Trp 

1700 1705 1710 

Val Val Ser Tyr Phe Trp Val Thr Gin Ser lie Lys Glu Arg Lys Met 
1715 1720 1725 

. Leu Asn Glu His Asp Phe Glu Val Arg Gly Asp Val Val Asn Gly Arg 
1730 1735 1740 

Asn His Gin Gly Pro Lys Arg Ala Arg Glu Ser Gin Asp Arg Lys lie 
1745 1750 1755 176 

Phe Arg Gly Leu Glu lie Cys Cys Tyr Gly Pro . Phe Thr Asn Met Pro 

1765 1770 1775 

Thr Asp Gin Leu Glu Trp Met Val Gin Leu Cys Gly Ala Ser Val Val 

1780 1785 1790 

Lys Glu Leu Ser Ser Phe Thr Leu Gly Thr Gly Val His Pro lie Val 

1795 1800 1805 

Val Val Gin Pro Asp Ala Trp Thr Glu Asp Asn Gly Phe His Ala lie 

1810 1815 1820 

Gly Gin Met Cys Glu Ala Pro Val Val Thr Arg Glu Trp Val Leu Asp 
1825 1830 1835 184 

Ser Val Ala Leu Tyr Gin Cys Gin Glu Leu Asp Thr Tyr Leu lie Pro 
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1845 



1850 



1855 



Gin lie Pro His Ser His Tyr 
1860 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
AATCTTAGAG TGTCCCA 17 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATCTTAGTGT CCCACCT 17 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CAGAAAAAAA GGTAGAT 17 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
■(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CAGAAAAAAA AGGTAGA 17 



(2) INFORMATION FOR SEQ ID NO : 7 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 7 : 
AGAGAATCCC AGGACAG 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
AGAGAATCCC CAGGACA 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TO POLOG Y : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 9: 
AGGACCTGCG AAATCCA 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
AGGACCTGTG AAATCCA 
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